Newness and Givenness of Information : Automated Identification in Written Discourse

نویسندگان

  • Philip M. McCarthy
  • Christian F. Hempelmann
  • Zhiqiang Cai
  • Danielle S. McNamara
  • Arthur C. Graesser
چکیده

The identification of new versus given information within a text has been frequently investigated by researchers of language and discourse. Despite theoretical advances, an accurate computational method for assessing the degree to which a text contains new versus given information has not previously been implemented. This study discusses a variety of computational new/given systems and analyzes four typical expository and narrative texts against a widely accepted theory of new/given proposed by Prince (1981). Our findings suggest that a latent semantic analysis (LSA) based measure called span outperforms standard LSA in detecting both new and given information in text. Further, span outperforms standard LSA for distinguishing low versus high cohesion versions of text. Our results suggest that span may be a useful variable in a wide array of discourse analyses.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Using LSA to Automatically Identify Givenness and Newness of Noun Phrases in Written Discourse

Identifying given and new information within a text has long been addressed as a research issue. However, there has previously been no accurate computational method for assessing the degree to which constituents in a text contain given versus new information. This study develops a method for automatically categorizing noun phrases into one of three categories of givenness/newness, using the tax...

متن کامل

Prominence variation beyond given/new

Prominence variation is known to be determined in part by discourse factors, such as the givenness or newness of the discourse entity being realized to the discourse. However, few empirical studies have been carried out to explain a wider range of phenomena occurring in natural speech corpora. In this study, corpus linguistics methods are applied to a task-oriented monologue corpus to show that...

متن کامل

Pre-focal givenness and accentuation in Estonian

A well-known factor affecting sentence prosody is Information Structure, including the givenness vs. newness of the information conveyed by a constituent. In many languages, givenness is expressed prosodically either by deaccentuation (e.g. [1]) or by a less prominent realisation of the accent, which may be achieved either by phonological means like accent type (e.g. [2]), or by phonetic means ...

متن کامل

Evaluating radio news intonation - autosegmental versus superpositional modelling

This study examines prosodic correlates of the givenness of discourse entities in German radio news speech. The material comes from the Stuttgart Radio News Corpus. Both GToBI intonation labels and a Fujisaki-style parametrization of the intonation contour were examined. We find strong word-class specific accentuation defaults; the influence of entity status is rather small and varies with word...

متن کامل

Informational Status and Pitch Accent Distribution in Spontaneous Dialogues in English

Revealing the relations between pitch accent types and the informational status of words requires a refined discourse analysis of spontaneous speech. A cooperative unscripted task in which subjects gave instructions for decorating Christmas trees successfully induced production of target adjective-noun pairs conveying new/given and contrastive information. Adapting Grosz and Sidner’s intention-...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016